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We propose a class of nonlocal boxes named functional boxes which include a generalization of 
the Popescu-Rohrlich(PR) box as a special case. We show that every functional box corresponding 
to an additively inseparable function can make communication complexity trivial and thus seems 
unlikely to exist in nature. Then we relate general nonlocal boxes to these functional boxes, and 
derive several limits on them from the principle of information causality. 
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I. INTRODUCTION 

Quantum mechanics allows distant parties to estab- 
lish certain correlations that arc impossible in the classi- 
cal world. These correlations are nonlocal, in the sense 
that they violate Bell inequality [TJ |2J. However, as first 
shown by Tsirelson 3] , their violation is still bounded. In 
a seminal paper, Popescu and Rohrlich |4J proposed a 
hypothetical nonlocal correlation that attains the maxi- 
mal value for the Clauser-Horne-Shirmony-Holt(CHSH) 
inequality^ but still cannot be used to signal from one 
party to another. Understanding why such stronger- 
than-quantum correlations have not been observed in na- 
ture has since then become an active topic of research 
[51131]. Recently, several information-theoretical princi- 
ples were proposed as candidates that separate physically 
realizable correlations from nonphysical ones. In this pa- 
per, we will be concerned with two of them — nontrivial 
communication complexity [5J and information causality 
[18] . Both of them show that allowing certain postquan- 
tum correlations would lead to implausible simplification 
of communication tasks. 

In one line, van Damp] showed that, equipped with 
many copies of Popescu- Rohrlich(PR) boxes[l], every 
communication complexity problem can be solved de- 
terministically by a single bit of communication. Then, 
Brassard et aL[10] extended this result by proving that 
if PR boxes can be implemented with probability greater 
than (3 + y/6)/6 « 90.8%, then the probabilistic com- 
munication complexity also collapses. These results were 
further generalized by Refs.[7] ITS ] IT9"]. 

In another line, Pawlowski et al. [T5] considered a sce- 
nario in which Alice had a database of N independent 
and random bits, and a distant party, Bob, was asked 
to guess a the fc-th bit in Alice's database for random 
k € {0, 1, . . . , N — 1}. They suggested a principle stating 
that Bob can gain at most m bits of information about 
Alice's database by using his local resources and receiving 
m bits from Alice. This principle was named information 
causality. When m — 0, it reduces to no-signalling. It 
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was demonstrated that using many copies of PR boxes 
Bob can correctly guess any bit of Alice with certainty 
by receiving only 1 bit from her. Moreover, any correla- 
tion exceeding Tsirelson's bound for the CHSH inequality 
violates information causality. The implications of infor- 
mation causality on nonlocality were further explored in 
Refs. [23 [27H3T] . 

A common feature of these two proposals is that the 
PR box perfectly cracks the information processing task 
under consideration. It also exhibits an extremely strong 
power for other tasks [HI 114] . The reason for the suc- 
cess of this box can be summarized as follows. Essen- 
tially, the aforementioned tasks can be viewed as that 
Alice and Bob want to compute some function f[x, y) in 
some distributed way, where x and y are initially held 
by Alice and Bob respectively. Note that any boolean 
(or arithmetic) function can be computed by a circuit 
consisting of XOR (or addition) and AND (or multipli- 
cation) gates. These two types of gates generally do not 
commute. However, using a PR box, AND operations are 
"transformed" into XOR operations, in the way that the 
AND of two inputs is encoded as the XOR of two outputs. 
So Alice and Bob effectively convert the original circuit 
into a "circuit" consisting of only XOR operations, which 
are commutative. Then Alice and Bob can reorder their 
operations, compact them, and minimize the interaction 
between them. This argument was made explicit in the 
discussion of communication complexity [5 . But it can 
also explain the success of PR box for the information 
causality task, since that task can be viewed as the com- 
munication complexity problem for a special function - 
the index function f(x, y) = x y , with the extra condition 
that the communication is only from Alice to Bob. 

In light of the above observation, we introduce a class 
of nonlocal boxes called functional boxes. This class of 
boxes include a generalization of PR box as a special 
case. A functional box's inputs and outputs have p pos- 
sible values which are identified with the elements of Z p , 
where p can be any prime number. This box encodes the 
value of a function / : Z p x Z p -> Z p over its inputs as 
the difference of its outputs. In particular, the PR box 
is just the functional box corresponding to f(x,y) = xy 
and p — 2. We find that, as long as / is not additively 
separable (see section 2 for definition), the corresponding 
functional box is asymptotically equivalent to a general- 
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ized PR box. (More specifically, an additively inseparable 
function contains a component that is about the multi- 
plication of its two variables, and we show that this com- 
ponent can be isolated out by using a difference method, 
and hence the corresponding functional box can be trans- 
formed into a generalized PR box.) We also prove that a 
generalized PR box can enable perfect distributed com- 
putation and make communication complexity trivial. As 
a result, any functional box equivalent to it has the same 
power and is unlikely to exist in nature. Then the next 
question is how well these functional boxes might be ap- 
proximated. We derive several bounds on the proxim- 
ity between a plausible p- nary- input, p- nary-output box 
and these functional boxes, from the principle of infor- 
mation causality. In order to do this, we extend the basic 
and nested protocols in Ref.[TH] to the p-nary digit case. 
Besides, we also present a generalization of the depolar- 
ization process that transforms any binary-input binary- 
output box into an isotropic one |lll 152"] . which might be 
of independent interest. 



II. FUNCTIONAL BOXES AND 
COMMUNICATION COMPLEXITY 

Let us first review several basic definitions about non- 
local boxes. 

Definition 1. A bipartite correlation box (or simply 
a box) is a hypothetical device shared by two spatial 
separated parties Alice and Bob that receives an input 
x £ X from Alice and an input y £ y from Bob, and 
outputs a £ A to Alice and b £ B to Bob, according 
to a joint probability distribution P(a, b\x, y). Without 
causing ambiguity, we also call this box P. 
If P satisfies 

£P(a, b\x,y) =Y, p {a,b\ x iV') = p ( a \ x ): Va,^,y,y'; 

b b 

J2P(a,b\x,y) =J2P(a,b\x',y) = P(b\y), Vb,x,x',y, 

a a 

(i) 

then it is no-signalling. Namely, Alice cannot signal to 
Bob via her choice of input to this box and vice versa. 

If Alice and Bob can simulate P using shared random- 
ness (without communication between them), then P is 
local. Otherwise, it is nonlocal. 

For example, the standard PR box is given by 

1 

2 

0, otherwise 



PR{a,b\x,y) = < 2 , (2) 



where a,b,x,y £ {0,1}. Note that PR(a\x,y) — 

PR(b\x,y) = -, \/a,b,x,y. So this box is no-signalling. 

In this paper, we focus on no-signalling boxes for which 
\X\ = \y\ = \A\ = \B\ = p, where p can be any prime 
number. Without loss of generality, we assume X = y = 



A = B = {0, 1, ... ,p — 1} = Z p , and all the following 
computation related to x, y, a, b is modulo p. 

Definition 2. A function F : T p l x Z™ Z p is distribut- 
edly computed by Alice and Bob if, when Alice is given 
any x — (xq,xi, . . . ,x n -i) £ Z™ and Bob is given any 
V = (j/o,2/i, ■ ■ ■ ,2/m-i) S Z™, Alice can produce a £ Z p 
and Bob can produce b £ Z p such that a — b = F(x, y). 

If F can be distributedly computed with certainty (or 
with constant probability), then the deterministic (or 
probabilistic) communication complexity of F becomes 
trivial, since Alice can simply send a to Bob and then 
Bob can calculate a — 6 = F(x, y). 

Let us first consider the following box, which is a 
straightforward generalization of standard PR box to the 
p-nary input/output case: 



Definition 3. 
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if a — b = xy 

PRp{a,b\x,y) = \ P 

0, otherwise 



(3) 



where a, b,x,y £ Z p . 



In particular, the standard PR box is just Pi?2- Note 

that PR p {a\x, y) — PR p (b\x,y) = -, Va, 6, x,y. So PR P 

is no-signalling. 

Given arbitrarily many copies of PR P , Alice and Bob 
will be able to distributedly compute any function F : 
Z™ x Z™ 1 — !• Z p perfectly. To prove this, fist note that 
F(x, y) can always be written as a multivariate polyno- 
mial whose degree in each Xj or yj is no larger than p— 1: 

p— 1 p— 1 n— 1 m—1 

H^y) = E e c n y? 

ao,— ,an-i=Oj8 o ,...,0m-i=O i=0 j=0 

(4) 

for some /i- ^ £ Z p , where a = (a Q , ax, . . . , a„_i) and 

P = (/3oj/3ii • • • lAn-i)- So Alice and Bob can execute 
the following protocol: for each (a, (3), they use a PR P as 

n— 1 m—1 „ 

follows: Alice inputs Y[ X T an d Bob inputs Y[ Vj > an d 

i=o j=o 
suppose they get outputs a s g and b 3 s respectively. In 

the end, Alice sets a — ^2 E A* 5 p a s 3 as ^ e:r nna ^ output, 

5 3 

and Bob sets b — Y^, E Ms 3^s 3 as ms nna l output. It is 

5 3 

easy to verify a — b — F(x, y). 

Now let us consider a wider class of boxes: 

Definition 4. For any function / : Z p x Z p — > Z p , the 
functional box corresponding to / is defined as 
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if a 



P f (a,b\x,y) = \ p 

( 0, otherwise 

where a, b,x,y £ Z p . 



f(x,y) 



(5) 



3 



Namely, Pf can be directly used to distributedly com- 
pute /. In particular, PR P can be viewed as the func- 
tional box corresponding to f(x, y) — xy. Note that 

Pf(a\x, y) — Pf(b\x,y) = -, Va,b,x,y. So Pf is also 

no-signalling. 

Definition 5. A bivariate function / : Z p x Z p — > Z p 
is additively separable if there exist univariate functions 
g, h : Z p — »■ Z p such that f(x,y) = g{x) + h(y), Vx,y € 
Zp. Otherwise, / is additively inseparable. 

Definition 6. Suppose / : Z p x Z p — ► Z p can be written 
as f(x,y) — ^i,i x% V^ f° r some Ajj G Z p . Define 

i,j=0 

A (f) = , mf A^iMi + j)}, (6) 

where I is the indicator function (i.e. Ie = 1 if E is true, 
and otherwise). Namely, A(/) is the maximum of the 
degrees of the terms in / that can be divided by xy. 

Obviously, a function / is additively inseparable if and 
only if A(/) > 2. Only an additively inseparable / con- 
tains a term that is related to the product of x and y. So 
one may naturally wonder if P* can be used to simulate 
PR p . If so, then pf can also benefit distributed compu- 
tation. We find that it is indeed the case, provided we 
are given sufficiently many copies of Pf . 

Definition 7. We use the notation P\ — > P 2 to denote 
the fact that we can use N copies of Pi to simulate a 
P 2 exactly, for some N > 1. We also use Pi o P 2 to 
denote that Pi — > P 2 and P 2 — > Pi, i.e. Pi and P 2 are 
asymptotically interconvertible. 

Lemma 8. ///:Z p xZ p ->Z p is additively separable, 
then Pf is local. Otherwise, Pf PR p . 

Proof: (l)Suppose f(x,y) — g(x) + h(y) for some g, h : 
Zp — > Zp, then we can build a local model for Pf as 
follows: Alice and Bob first generate a uniformly random 
variable z € Z p , then Alice outputs a = g(x) + z and Bob 
outputs b = —h(y) + z. 

(2) Suppose / is additively inseparable. We first show 
PRp — > Pf . Recall that we have already shown how to 
use many copies of PR p to perform distributed compu- 
tation of any functions, including /. So Alice and Bob 
can first use that protocol to obtain a and b such that 
a — b = f(x,y). Then they generate a uniformly ran- 
dom z € Zp and modify their outputs by a — >• a + z and 
b^b + z. 

It remains to show P? — > PR p . Proof by induction on 
A(/): 

• Base case: A(/) = 2. In this case, we have 

f(x, y) = Xxy + g(x) + h(y) (7) 

for some A ^ and g, h : Z p — > Z p . We can use 
a P f to simulate a PR P as follows: Alice inputs x 



and Bob inputs y to P? , and suppose they obtain 
outputs a and b respectively. Then Alice sets 

a' = X~ 1 (a — g(x)) (8) 
as her final output, while Bob sets 

V = \- 1 (b + h(y)) (9) 

as his final output. Then we have 

a'-b' = \- 1 {a-b-g{x)-h{y)) 

= X-\f{x,y)-g{x)-h{y)) (10) 
= xy. 

Furthermore, since a (or b) is uniformly random, a' 
(or b') is also uniformly random conditioned on any 
(x,y). 

• Inductive step: Suppose P^ — > PR P for any / with 
A(/) = k for some k > 2. Consider any / with 
A(/) = k + 1 > 3. Such f(x, y) (viewed as a bivari- 
ate polynomial) contains a term that is a multiple 
of x 2 y or xy 2 . We deal with the two cases sepa- 
rately. 

In the first case, consider 

fW(x,y)^f(x + l, y )-f(x,y). (11) 

Compared to /, the degree of in x is decreased 
by 1, while its degree in y is the same or smaller. 
Moreover, it is easy to see 

A(/«) = A(/) - 1 = k > 2. (12) 

f (!) 

Thus by induction P' — > PR p . Furthermore, 

ft 1 ) f 
P 1 can be simulated with two copies of P' as 

follows: for the first pf , Alice inputs x + 1 and 

Bob inputs y, and assume they receive outputs ai 

and bi ; for the second P^ , Alice inputs x and Bob 

inputs y, and assume they receive outputs a 2 and 

&2- Then Alice sets a = ai — a 2 as her final output, 

and Bob sets b = bi — b 2 as his final output. Then 

we have 

a - b = (ai - a 2 ) - (&i - 62) 

= (ai -61) - (o 2 -62) n ,x 
= /(* + 1,2/) -/(*, </) 

= / (1) (^y)- 

So Pf -> P/ (1) -> PPp. 

A similar argument holds for the second case. But 
in this case, we consider 

fW(x,y)^f(x,y + l)-f(x,y), (14) 

which satisfies 

A(/( 2 )) = A(/) - 1 = k > 2. (15) 

f ( 2 ) 

Then by induction PJ — > PR p . Furthermore, 
can also be simulated with two copies of Pf . 
Therefore we have Pf -> P /<2> -> PP„. 
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Remark 9. This proof actually yields a recursive strat- 
egy to simulate PR P with 2 A< -^~ 2 copies of . To be 
specific, we have shown 

pi ph _> p/a _> > pU ^ PRpi (i 6 ) 

where fi(x,y) = /;_i(x + l,y) - f i _ 1 (x,y) or fi(x,y) = 
f i - 1 (x,y+ 1) - fi-i(x,y). The strategy is to use a P /l 
to simulate a PR P , where this P^ L is simulated with two 
copies of P^ i_1 , where each copy of pi 1 -- 1 is simulated 
with two copies of P? 1 -- 2 , and so on. Overall, 2^^ -2 
copies of Pf is used, since A(/j_i) — A(/j) = 1 and 
A(/ L ) = 2. 

This strategy can be demonstrated by the following ex- 
ample. Suppose p = 3 and / : Z3 x Z3 — >• Z3 is given 
by 

f(x,y) = x 2 y 2 + 2xy 2 + xy + 2x. (17) 

Using two copies of P* , we can simulate a P-^ 1 where 

h(x,y) = f(x + 1, y) - f(x, y) , ■. 

- 2xy 2 + y + 2. 

Then using two copies of P-* 1 (each of which is simulated 
with two copies of P?), we can simulate a P-^ 2 where 



(19) 



/afoy) = fi(x,y + l) - fi(x,y) 
= xy + 2x + 1 

Finally, we can simulate a PP p with a P-* 2 by Alice sub- 
tracting her output by 2x + 1. Overall, four copies of P^ 
is used to to simulate a PR„. 



A. Generalized Depolarization Process 

It is well known that any binary- input, binary-output 
box can be transformed into an isotropic one: 



P lso (A) = APP 2 + (l-A)Pv 



(20) 



where P/v is the completely random noise (i.e. 
Pjv(o, b\x,y) = -, Va,b,x,y G Z 2 ), and Pi? 2 is the anti- 
PR box (i.e. PR2(a, b\x, y) — — if a + b = xy + 1, and 

otherwise, Va, 6, € Z2), via the so-called depolar- 
ization process [TTJ [32]: Alice and Bob generate three in- 
dependent and uniformly random bits a, /? and 7, and 
modify their inputs and outputs by 



X — > a; + a, 

2/ -+y + P, 

a — > a + fix + a/? + 7, 

6 — > 6 + ay + 7. 



(21) 



Here we prove an analogue of this result for p-nary- input, 
p- nary-output boxes. 



Definition 11. 



PR Ptj (a,b\x,y) = { p' 



if a — b = xy — j 



(22) 



0, otherwise 



Hence there are only two inequivalent classes of func- 
tional boxes with respect to asymptotic transformation: 
those corresponding to additivcly separable functions are 
local and cannot benefit distributed computation, while 
the others are all equivalent to PR P and can enable per- 
fect distributed computation and make communication 
complexity trivial. Therefore, we have 

Theorem 10. In any world where communication com- 
plexity is not trivial, the functional box P' corresponding 
to any additively inseparable function f : Z p x Z p Z p 
cannot be implemented perfectly. 



III. LIMITS ON NONLOCALITY FROM 
INFORMATION CAUSALITY 



where a,b,x,y,j £ 7L P . 

Lemma 12. Given any p -nary-input, p-nary-output box 
P , define 

1 p-i 

fij = -2 J2 P(a-b = xy-j\x,y), (23) 

P x,y=0 



wht 



p-l 



P(a -b = xy- j\x,y) = £ P(a = k,b = k- xy + j\x,y). 

fc=0 

(24) 

Then 



p-i 

3=0 



(25) 



In the previous section, we have shown that an ex- 
act implementation of P* for any additively inseparable 
function / is impossible, unless communication complex- 
ity collapses. So now the question is how well these boxes 
might be implemented in nature. In this section, we par- 
tially answer this question by deriving several bounds 
on the proximity between any plausible p- nary- input, p- 
nary-output box and these functional boxes, from the 
principle of information causality. But before doing that, 
we need to first present the following result which is a key 
ingredient to our analysis. 



Proof: Consider the following protocol: Alice and Bob 
generate three independent and uniformly random vari- 
able a, /3,7 € Z p . They input 



x = x + a, 

y' =y + P 



(26) 



to box P. Suppose they obtain outputs a! and b' . Then 
they set 



a = a' — fix — a/3 + 7, 
b = b' + ay + 7 



(27) 
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as their final outputs. 

Suppose this protocol realizes a box P. Then for any 
given a,b,x,y, 

P{a,b\x,y) = \ I) P(a\b'\x',y'). (28) 

P a,0,y=O 

Note that 

a! — V — x'y' = (a + j3x + aft — 7) — (b — ay — 7) 
-(x + a)(y + ft) 
= a — b — xy. 

(29) 

Besides, (x',y') are uniformly random in Z p x Z p ; a' (or 
b') is also uniformly random in Z p conditioned on any 
{x' ,y'). So, if a — b = xy — j, then 

P(a,b\x,y) = ^ E P(a'-b' = x'y'-j\x',y') 

V x',y'=0 



which implies 



P 



(30) 
(31) 



where I{xi : b\y = i) is the Shannon mutual information 
between Xj and 6, under the condition that Bob receives 
y = i. Note that if P(b — X{\y = i) = Pi, then by Fano's 
inequality, 

jV-l N-l 

7>Mog 2 p- E E (l-Pi)loga(P-l) 

i=0 i=0 

(33) 

where h(x) — —x\og 2 x — (1 — x)log 2 (l — x) is the binary 
entropy function. 

The principle of information causality states that for 
any physically allowed theories, we must have 



I < mlog 2 p. 



(34) 



Both classical and quantum correlations satisfy this con- 
dition. However, it is unknown whether all postquantum 
correlations violate this condition. 

Now suppose Alice and Bob share unlimited number 
of copies of a p-nary-input, p-nary-output box P, and 
Alice is allowed to send only one p-nary digit to Bob, 
i.e. m = 1. We are going to investigate how P can help 
them in this task, and presents several limits on P from 



condition (34). 



We will consider the cases of N < p and N > p sepa- 
rately. 



Hence, any p-nary-input, p-nary-output box can be 
transformed into a probabilistic mixture of PRpj's, each 
of which is essentially equivalent to PR P up to an addi- 
tive shift of the outputs. 



Remark 13. Lemma 12 can be straightforwardly gener- 
alized to the case of any g-nary input/output box where 
q does not have to be prime. 



B. Limits on Nonlocality from Information 
Causality 

Let us briefly review the principle of information 
causality [TS]. It was introduced via the following com- 
munication task, which is similar to a random access 
code [53] or oblivious transfer [51 Suppose Alice 

and Bob are two spatially separated parties. Alice re- 
ceives a string of N random and independent p-nary dig- 
its x = [xq , 11, . . . , xn- 1 ) € lip . Bob receives a random 
variable y £ {0,l,...,iV — 1} and is asked to give the y-th 
digit of Alice. To achieve this, they may share in advance 
some no-signalling resources such as shared randomness, 
entangled states or nonlocal boxes. Besides, Alice is al- 
lowed to send at most m p-nary digits (or equivalently, 
mlog 2 p bits) to Bob. Let us denote Bob's output by b. 
The degree of their success is quantified by 



N—l 

1= E I{xi-.b\y = i) 



(32) 



Case 1: N <p 

In this case, Alice receives x = (xi,X2, ■ ■ ■ ,xn) C 
and Bob receives y G {0, 1, . . . , N — 1} C Z p . Bob aims 
to obtain the value of 



F(x, y) = Xy 



(35) 



Note that 



JV-l 



F(S,y)=J2[ n (i-3)-Hv-3)]xi (36) 

*=0 0<j^i<N~l 

for any x e 1^ and y £ {0,1, ... , N-l} C Z p . Moreover, 
the right-hand side of the above equation has degree N—l 
in y. So F{x,y) can be rewritten as 



N—l 

F(x,y)= E y k Fk(x) 

k=0 



(37) 



for some Fh : Z 



Z p . This fact suggests a protocol 



that generates each term y k F}~(x) independently and 
then sums them together. For PR P , this can be achieved 
by Alice inputting Fk(x) and Bob inputting y k , then 
the difference between their outputs would be y k Fk(x). 
For a general box P, by lemma [12J it can be converted 
into a mixture of PR P j boxes which can be viewed as 
an imperfect PR P box with random additive noise. As 
long as the random noise is not very bad, we can still 
pretend it to be a PR p and achieve a high efficiency So 
consider the following protocol: 
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Protocol 1 basicRAC(p, N, c, P, x, y) 

Setup: p is prime. N < p. Alice has x € Zp and Bob 
has y £ {0, 1, . . . , N - 1}. They share at least TV - 
1 copies of a p-nary-input, p-nary-output box P. 
They also choose c £ Z p . 

Steps: 1. Alice and Bob convert each copy of P into a copy 



of P = ^ [ijPR P: j (where fj,j is given by Eq.(23l), 

3=0 ' I 

using the protocol given in the proof of lemma jl2| 

2. Alice and Bob use N — 1 copies of box P as fol- 
lows: for the fc- th b ox, A lice inputs F k (x) (which 
is given by Eqs.(36l and ( 37 1 ) and Bob inputs y k , 
and suppose they get outputs a k and b k respec- 
tively, Vfe = 1,2, ...,N-1. 

N-l 

3. Alice sends q — ^ a k + Fo(x) to Bob. 

fe=i 

JV-l 

4. After receiving q, Bob outputs b = q— b k — c. 



(X(,,X lt ... 6 2p 
_l 



ye {0,1 W-l} 

i 



«2 



fjV_!(S) 



♦ t 



FIG. 1: Basic Protocol for N < p. Each black box represents a 
copy of P that is obtained f rom a copy of P using the protocol 
given in the proof of lemma 12 Fq(x), Fi(x), . . . , Fn-i(x) are 
given by Eqs.(36l and (371. 



For an illustration of this protocol, see Fig. [T] 

Remark 14. When p = 2 and N = 2, we have 
F(xo,xi,y) — xq + (xq + xi)y and hence Fx(xq,xx) = 
x + xx, F Q (xo, Xi) = Xq. Then the above protocol with 
c = reduces to the basic protocol in Ref . [T5] . 

Let us analyse the efficiency of this protocol. 
First, consider the special case of P = PR p and c = 0. 
In this case, step 1 does not have any effect on PR P , and 



we have P — PR P . So in step 2, we have 

a k -b k = F k (x)y k , Vfc. (38) 
As a result, Bob's output 

JV-l JV-l 

b = q- E h= J2 {a k -b k )+F (x)=F(x,y), 
k =i fc=i 

(39) 

So Bob correctly guesses x y with certainty. 

However, by lemma 12 P is generally not PR P , but 
equals PR P j with probability /ij, Vj € Z p . Then 

a k -b k = F k (x)y k - j k 

where 

P(jk=j) = fJ. 3 , VjeZp. 
And Bob's output is 



(40) 
(41) 



N-l 

b = F(x, y) - Jk - c, 
fc=i 



which is correct if and only if 



N-l 

E Jk = -C, 
k=l 



which happens with probability 
X P (P,N- l,c) 



N—l 

e n 

JiH h?V-i=-c k=l 



(42) 



(43) 



(44) 



where /2 = (/io, fix, ... , /i p _i). Note that this probability 
is independent of y. So by Eq.(33), we must have 



N - 1 

Xp {fl,N-l,c) < e-i(_^lo g2 p), (45) 



where 



0p(a;) = fc(a;) + (1 - a;)log 2 (jj - 1) 



(46) 



and p x (-) is its inverse function |35j . for condition (34) 
to be satisfied. 

Note that Vc e Z p , VM > 1, 



M 

x P {p,,M,c) = e n H k 

ilH hjj\/=-c fe=l 

= -tr(Z p c (EV^) M ) 

- - p e<(eV,o m - 

P fe=o i=o 



(47) 



where Z p = diag(l, lu p , . . . , ujP x ) is the p-dimensional 
generalization of Pauli Z matrix, w p = e i27r / p , and in 
the second step we use tr(Z*) = 0, Vi = 1, 2, . . . ,p — 1. 
Therefore, 

Theorem 15. In any world where information causality 
holds, any p-nary-input, p-nary-output box P satisfies: 

ViV < P , Vc e Z p , 

i p-i p-l AT _ i 

" E <(E MiO^ 1 < e-H-r^log^), (48) 
P fe=o j=o JV 



where fij is defined as Eq. ( ^3 ) 
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Case 2: N > p 



n-l 

Assume N — p n and y — ViP % f° r Vi G Z p . We 

i=0 

present a recursive protocol that calls the basicRAC pro- 
tocol as a subroutine. It is the p-dimensional generaliza- 
tion of the one given in Ref . [T5] . 



Protocol 2 recursiveRAC(p, N, c, P, x, y) 

Setup: p is prime. N — p n for some n > 1. Alice has 

n— a 

k £ and Bob has y = ViP* where yt £ Z p . 

»=o 

They share at least N — l copies of a p-nary-input, 
p-nary-output box P. They also choose c £ Z p . 
Steps: If TV = p, then Alice and Bob execute 
basicRAC (p, TV, c, P, x, y). Otherwise: 

• Alice: she divides her input x into 
— substrings: zo = (xo,xi, 



Z\ = (Xp,Xp+l, . . . ,X2p-l), ZN_ 1 = 

p 

(x N -p,x N - p+1 , . . . ,x N -i). For k = 
0, 1, . . . , — — 1, she executes her part of 
basicRAC(p,p, c, P, Zh, yo), except that she 
does not send her message (which is de- 
noted by qk). Then she executes her part 
of recursiveRAC(p, ^, 0, P, q, y'), where q — 

n—l 

(9o,9i, • • • ,q*L-i) and y' = yip 1 ' 1 ■ 



• Bob 



he executes his part of 
TV 

recursiveRAC(p, — , 0, P, q, y'). (Here 
n-l 

still y' = yiP 1 ' 1 ■) Suppose the output 

i=l 

from this protocol is q. Then he executes 
his part of basicRAC (p, p, c, z y t , yo), except 
that he uses q as the message received from 
Alice. The output from this protocol is set 
as his final output. 



Fig. [2] illustrates an example of this protocol for p = 3 
and N = 9. 

The recursiveRAC protocol can be viewed as a level-n 
pyramid of basicRAC protocols. Its idea is that Alice 
uses level k + 1 to transmit her messages generated at 
level k, while Bob uses level k + 1 to reveal the mes- 
sage he needs at level k, Vfe = 0, 1, . . . , n — 1. Although 

n-l 

Alice accesses totally (p — 1) ^ p k = N—l copies of 

fc=0 

P, Bob only accesses n(p— 1) copies of P and only these 
boxes are truly relevant to his final output. Each of these 
boxes contributes to his final output an additive shift 
that equals j with probability fij, Vj £ Z p . So Bob's fi- 
nal guess is correct if and only if all these additive shifts 



S a {x ,x 1 x 8 ) e ; 

Alice 

Level 0: 



y = Oiyo) 3 e {o,i 8} 

Bob 



X Q — ( x 0> x l> x 2) 

A 



1 

A 



yo 

.A. 



BasicRAC 



1 

x 2 — ( x 6- x 7- x 8) 

A 



1 

A 



FIG. 2: Recursive protocol for N > p. Here we show an 
example for p = 3 and N — 9. Suppose Alice receives 
x = (xo, xi, . . ■ , xs) £ Z3 and Bob receives y = (2/11/0)3 £ 
{0, 1, . . . ,8}. They also share at least N — 1 = 8 copies of 
some box P. The protocol consists of three level-0 and one 
level-1 executions of the basic protocol illustrated in Fig[T] 
For Alice, she executes the three level-0 basic protocols with 
inputs (xo, 35i, X2), (x3,X4,X5) and (xq, X7, x$) respectively. 
Suppose her messages (which are not really sent) are qo, 91, 92 
respectively. Then she executes the level-1 basic protocol with 
input (go, 91, 92), and truly sends her message for this one. For 
Bob, he executes the level-1 basic protocol with input j/i, and 
suppose his output is q. Then he executes only the j/i-th 
level-0 basic protocol with input j/o(this figure shows the case 
j/i = 1). During this protocol he pretends that q is the mes- 
sage received from Alice. At last, this basic protocol's output 
b is set as Bob's final output. Note that each basic protocol 
costs p — 1 = 2 copies of box P, so Alice totally accesses 8 
copies of P, while Bob only accesses 4 copies of P. 



sum to — c, which happens with probability 

ra(p-l) 

X P (P,n(p- l),c) = E 11 Mj fe 

jl+---+ln(p-l)=-C k = l / 4 gN 
1 p— 1 p— 1 ^ ' 

P k=0 j=0 



So, by Eq.(33), we must have 



N-l, 



Xp (fl,n(p-l),c) < e-H^^h^p), (50) 



otherwise condition (34 1 is violated. Thus, 



Theorem 16. In any world where information causality 
holds, any p-nary-input, p-nary-output box P satisfies: 
Vn > 1, N = p n , Vc € Z p , 



to P, and Bob inputs 



1 p-l p-l ]W _ 1 

- E <'( E wi k ) nip - x) < ^(^^p), (51) 

P k=0 j=0 ly 



where fij is defined as Eq.(23). 



1. Bounds with respect to general functional boxes 

Theorems [15] and [16] can be viewed as giving bounds 
on the proximity between a plausible p-nary-input, p- 
nary-output box and PR p . By lemma M pf and PR P 
are interconvertible for any additively inseparable func- 
tion /. So there should be also bounds on the proximity 
between a plausible p-nary-input, p-nary-output box and 
the corresponding functional box. In what follows, we 
will give several such bounds. 

Definition 17. For any function / : Z p x Z p — > Z p , 

define 

f , , f — , if a — b = fix, y) — j , , 

Pj(a,b\x,y) = i p i (52) 

I 0, otherwise 
where a, 6, a;, y, j € Z p . 

P^ and pf are essentially equivalent, except for an 
additive shift of the outputs. 

We will consider the case of A(/) = 2 and A(/) > 2 
separately. 

Case 1: A(/) = 2. 
Suppose 



f{x, y) = Xxy + g(x) + hiy) 



(53) 



for some A 7^ and g,h : Z p Z p . 

Given any p-nary-input, p-nary-output box P, define 

= i E P(a-b = fix, y) - j\x, y) (54) 



where 



P x,y=0 



P(a- 6 = /(x,y) - j|x,y) 

E 1 p(a = fc,& = fc-/(.x,y)+j|.T, 2 ;). ,:m1 

fc=0 



Then we have 



E ' 



(56) 



To prove this, consider the following protocol: Alice and 
Bob generate three independent and uniformly random 
variable a, 0, 7 € Z p . Alice inputs 



(58) 



to P. Suppose they receive outputs a' and b' respectively. 
Then Alice sets 

a = a' - XfJx - Xaf3 - g(x + a) + g(x) + 7 (59) 

as her final output, and Bob sets 

b = b' + Xay + hiy + 0) - hiy) + 7 (60) 

as his final output. Suppose this protocol realizes a box 
P. Then for any given a, b, x, y, 

?ia,b\x, y ) = 1 P E Pia',b'\x',y'). (61) 

P q,/3,7=0 

Note that 

a' - b' - f{x',y') = (a + X/3x + Xa(3 + gix + a) - g(x) 
-7) - (6 - Xay - hiy + (3) + hiy) 
-7) - (A(x + a)iy +J3) + gix + a) 
+h(y + f3)) 
= a-b- f {x,y). 

(62) 

Besides, ix',y') are uniformly random in Z p x Z p ; a' (or 
b') is also uniformly random in Z p conditioned on any 
ix',y'). So, if a- b = f(x t y) - j, then 



Pia,b\x,y) 



which implies 



A £ Pia'-b' = fix',y')-j\x',y') 

P x',y'=0 
V-i 



P ' 



-1 

0=0 



p = E "iPj- 



(63) 



(64) 



Now recall that in the proof of lemma |H] we gave the 
following protocol that converts a P? to a PR p : Alice 
inputs x and Bob inputs y, and suppose they receive a 
and b. Their final outputs are a 1 = A _1 (a — gix)) and 
V = X^ib + hiy)). Note 



a'-b'-xy = X x (a - 6 - fix, y)). 



(65) 



So this protocol converts a p/ to a PR pX -ij- Hence, we 
have 



E '',/'; ^E'^/'^.x ,• 



J=0 



J=0 



Combining Eq. ( 56 1 and ( 66 ) , we obtain 



p^ E VjPRp,x-> 

J=0 



(66) 



(67) 



ar = a; + a 



(57) Then by theorems 15 and 16 we get 
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Theorem 18. In any world where information causality 
holds, for any additively inseparable function / : Z p x 
Z p — > Z p with A(/) = 2 and any p-nary-input, p-nary- 
output box P , we have: 

• VA < P , Vc e Zp, 



1 p-i p-i AT — 1 

- E <(E ^-O^ 1 < e-^- — log 2 p); (68) 
P k=0 j=o ' JV 



Vn > I, N = p n , Vc e Zp, 



1 p-l p-l ]\T _ 1 

- E <( E ^-wf < 0- H— — lo g2 p), (69) 

P fc=0 j=0 JV 



where Vj is defined as Eq.(54) 



Case 2: A(/) > 2. 

Given arbitrary p-nary-input, p-nary-output box P, we 
change it slightly as follows: Alice and Bob generate a 
uniformly random 7 £ Z p , and modify their outputs by 
a — > a + 7 and 6 — > + 7. Suppose this modified box is 
P'. They we have 



P'{a,b\x,y) 



1 



x,y,f(.x,v)-a+b 



where 



E 



x,V,3 



Define 



P(a -b = f(x,y) -j\x,y) 
p-i 

P(a = k,b=k - f(x, y) + j\x, y). 

fe=0 



Vi = min E x v a . 



Then we have 



p ' = X? VjPj + (1 - E 1 v i) p " 

j=0 j=0 



(70) 



(71) 



(72) 



(73) 



for some box P". 

Now recall that in the proof of lemma[5J we have given 
a recursive protocol that transforms M = 2 A " copies 
of Pf into a copy of PR p . Let us see what happens if we 
apply that protocol to P' . Suppose the resulting box is 
P. Note that P' acts as Pj with probability Vj (or acts 

p-l 

as P" with probability 1 — E ^i)- So each time Alice 

3=0 

and Bob access a P' , their outputs are basically the same 
as those of Pf except for an additive shift which is j with 

P-i 

probability i/j (or with probability 1 — E tlie- outputs 

3=0 

are nonsense). These additive shifts are multiplied by a 
factor A -1 for some A 7^ (see Eq.(66)). And the overall 



multiplied shifts minus the sum of the other half, since 
we use a difference method. So, 



p= Ert + (i- Eh)p" 

3=0 3=0 



where 



l'j 



M 

e n 

(jit— Jm)£Sm,3 k=1 



(74) 



(75) 



in which 



S M ,j = {(iii • • • ! 3 m) 6 Zp ■ E 3k - E 3k = Aj} 

fc=i fe=^+i 

(76) 

and P'" is some box. 

Then, by lemma [l2j we have 



P^P= E 



(77) 



for some > fj,j. 

Now assume Alice and Bob execute 
basicRAC(p,7V,c,P,f,y) for N < p. Then Bob's 
guess is correct with probability 

N-i 

X P {%N - l,c) = E II ft k 

jiH hjw-i=— <= fe=l 

AT-1 

> e n m« 

jiH hjjv-i=— c fe=l 



E 

0'i:---Jm(jv-i))S'S M ( J v-i), 

<7p(i7,M(JV-l),c), 



M(JV-l) 

n - 

fe=i 



where in the second step we use /7J > /ij > 0, and in the 
third step we use Eqs.(75l and (76 1, and in the last step 



v = (uq, v\, . . . , Vp-i). So we must have 



a p {v,M{N-l),c) < Q-^E—Liog^ (79) 



otherwise condition ( 34 ) is violated 



Similarly, if Alice and Bob execute 
recursiveRAC(p, N, c, P, x, y) for N = p n , then Bob's 
guess is correct with probability 

X P {% n(p ~ 1), c) > crp(^, Mn(p - 1), c), (80) 
So unless 

a p (?,Mn(p-l),c) < e- 1 (^^log 2 p), (81) 



shift in the final output is the sum of a half of these condition ( 34 ) is violated. 
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Note that VX > 1, Vc G Z B 



a p (u,L,c) = E II v 3 k 

(ju-,jL)es L ,- e k=i 

P j=0 j=0 

= ^E< fe ( P E^<)*(E 1 ^ ife )* 



P k=a 



3=0 



3=0 



(82) 



where in the second step we use tr(21) = 0, Vi = 
1,2, . . . ,p — 1. So we have 

Theorem 19. In any world where information causality 
holds, for any additively inseparable function f : Z p X 
Z p — > Z p wif/i A(/) > 2 and any p-nary-input, p-nary- 
output box P , we have: 

• VN <p, L = {N - 1)2 A W')- 2 , Vc G Zp, 



1 p— 1 p—1 p— 1 AT "1 

- E <(£ ^ P fe )*(E ^V fc )* ^ e-H^r^iogap); 

P fc=0 j=0 j=0 JV 

(83) 



Vn > 1, N = p n , L = n{p- 1)2 A W)- 2 , Vc G Z p , 



1 P-l P-l r P-l r /V — 1 

- E <(E ^<ME KiV fc ) T < e-H^r^iogap), 

(84) 



where Vj is defined as Eqs.(71) and (72). 



Remark 20. Since quantum correlations satisfy the 
principle of information causality, theorems [l5j [l6| [18] 
and [19] all apply to them. 



distributed computation and make communication com- 
plexity trivial. So all such functional boxes are unlikely 
to exist. Furthermore, we investigated how proximate 
can a general box be to these functional boxes without 
violating the principle of information causality. 

Our work raises many new questions: 

First, we have shown that if PR P box can be imple- 
mented exactly, it would lead to the collapse of determin- 
istic communication complexity. But we do not know 
how much noise it can tolerate while still making prob- 
abilistic communication complexity trivial. And what 
about general PH 

Second, in the proof of lemma [8j we gave a protocol 
that transforms 2 «J copies of P f into a PR p . That 
protocol is universal, but might be not optimal for some 
/. If we can simulate a PR P with fewer copies of P* , then 
the bounds in theorem [19] can be improved accordingly. 
In fact, can we directly use P? to perform distributed 
computation, instead of first converting it into PR p ? 

Third, as pointed out in Ref.[T7], the set of physi- 
cally allowed boxes should form a closed set under local 
wirings. Namely, if a set of boxes P-l,P%, . . . , P m are all 
allowed by a physical theory, then a new box P obtained 
by locally connecting these boxes should also be allowed 
by this theory. Conversely, if P is not allowed, then at 
least one of P 1 ,P 2 , . . . , P m should be prohibited. Since 
we have already obtained a set of implausible postquan- 
tum correlations, can we use this approach to rule out 
more? 

Finally, here we have only considered bipartite p-nary- 
input, p-nary-output boxes. It would be interesting to 
extend our results to more general boxes with arbitrary 
number of inputs and outputs, as well as multipartite 
boxes. 



IV. CONCLUSION 



In summary, we have proposed the class of functional 
boxes which incorporate the generalized PR boxes as a 
special case. Every functional box corresponding to an 
additively inseparable function is asymptotically equiva- 
lent to a generalized PR box, which can enable perfect 
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